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ABSTRACT 

A technique is presented for obtaining a control law to 
regulate the modal dynamics and identify the modal parameters 
of a flexible structure. The method is based on using a 
min-max performance index to derive a control law which may 
be considered to be a best compromise between optimum 
one-step control and identification inputs. Features of the 
approach are demonstrated by a computer simulation of the 
controlled modal response of a flexible beam. 

I. INTRODUCTION 

A class of indirect adaptive control systems proposed for the 
control of large space structures [1] is based on a modal 
decomposition of the system dynamics and may incorporate one 
or more on-line testing schemes [2] to determine when 
successful parameter identification has been achieve. The 
control strategy used in calculating the actuator inputs must 
achieve adequate regulation or tracking performance and, at 
the same time, provide inputs to allow adequate parameter 
identification. A ont*"ol system designer is thus faced with 
the problem of ■'••'vising a control strategy to ensure 
acceptable system performance even when on-line parameter 
identi f iabil i ty tests have failed because the system 
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configuration has changed or the environment In which the 
system operates has changed. 

In this paper we formulate and examine the performance of a 
nonlinear dual -adaptl ve control scheme In which a 
sampled-data controller Is designed to select a best 
compromise between an Input signal that Is optimum for 
mean-square system regulation and an Input signal that Is 
optimum for parameter Identification. Dual control theory, 
originally formulated by Feldbaum [3,4], has been studied In 
[5-7] and in the references cited therein. A key concept 
introduced by Feldbaum is the dual control strategy based on 
a performance Index that takes Into account the fact that 
future observations on the process will be made. A 
controller may be able to "probe" the system for state and 
parameter estimation Improvement, which then may improve 
future regulation and tracking performance. In many 
situations where the dual nature of stochastic control Is not 
taken into account the controller becomes "cautious" [5,6] 
and tends to "turn-off". This undesirable phenomenon Is 
avoided by the approach described below. 

II. FORMULATION OF AN ADAPTIVE PERFORMANCE INDEX 


The dl screte-tlme dynamics for each mcde Is assumed to be 
described by the ARMA model 


y( t)+aiy( t-l) + a 2 y( t-2) = biu( t-l)+b 2 u(t- 2 )+e(t) (1) 


where y ( t ) denotes modal displacement, u(t) denotes modal 
force, and e(t) Is a sequence of Independent, 
equally-distributed, normal ( 0 , 0 ?) random variables. It Is 
assumed that e(t) Is Independent of y( t-1) ,y( t-2) , . . . , 
u( t-1) ,u( t-2) , . . . and that the parameters a 1 .a 2 .b 1 .b 2 
are unknown constants. If we let Yt denote the Information 
available to the controller at time t. 


Yt ■ |y(t),y(t-l),..., u( t- 1) ,u( t-2) , . . . | 

x ( t > denote the modal parameter vector and e(t) denote ,a 
modal measurement vector, 

x T (t) = (ai ,a2 ,bi,b 2 ) ; 

®T( t ) « ( -y( t-1 ) ,-y { t-2) ,u( t-1) ,u( t-2) 

where ( . ) T denotes vector or matrix transpose, then (1) 
may be rewritten as 

y ( t ) = e T ( t)x( t)+e( t) 


( 2 ) 


(3) 


( 4 ) 
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where the constant parameter "dynamics" satisfies 


x(t + l) * x ( t ) 


( 5 ) 


It can than be shown, following the analysis of [8], that the 
conditional distribution of x(t+2) given Y^+i Is normal 
with mean x(t+2) and covariance matrix P(t+2) where x(t) and 
P(t) satisfies the difference equations 

x(t+l) * x( t) + K( t) (y( t)-e T ( t ) x ( t ) ) 

KC t) - P(t)e(t)/(a2*eT(t)P(t)e{t)) 

P(tn) = P(t)-(P(t)e(t)eT(t)P(t))/ 

( o 2 +eT ( t)P{ t)e{ t) ) 

Furthermore, the control law that minimizes the regulation 
crl terl on 

V c ( u( t) ) * E{y2(t+l)IY t } 

Is given by 


( 6 ) 

(7) 

( 8 ) 


(9) 


u ( t ) = 


x 1 (t+l)x 3 (t*l)+P 3i (t+l))e 1 (t*l) 

x|(t+l)+P 33 (t+l) 


( 10 ) 


where denotes the sum over 1 • 1 to 4 with the value 3 
excluded. 

To provide bounded modal Inputs that Improve parameter 
Identification accuracy while guaranteeing that the modal 
amplitude will not become excessively large, the controller Is 
designed to optimize, at each sampling Instant t, the following 
performance criterion: 


ml n max C V ( x, u( t) ) ] 
u( t) x 

subject to the constraints 
u(t) ^M, 0 £ X £ 1 

where 


V( x, u{ t) ) 


V c (u(t)) 


+ (1-x) 


Vj(u(t)) 


(ID 


(12) 


(13) 
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o 

V c denotes an acceptable or desired level of regulation cost. 
Vj(u(t)) denotes and identification cost function of u(t), 

V I ( u( t) ) = trace £p ( t+2 )J (14) 

Q 

Vj denotes and acceptable or desired level of Identification 

cost. The maximization Indicated In (11) yields a function 
V ( u ( t ) ) which, although not convex. Is Interpreted as 
specifying, for each admissible u(t), the most costly linear 
combination of relative regulation and relative identification 
cost. Minimization of V(u) thus yields the modal Input that 
minimizes this most costly combination of relative 
identification and regulation performance. 

III. SIMULATION RESULTS 

Since V c (u(t)) and trace P(t + 2) are relatively simple 
functions of u(t) the numerical solution of the one-step 
optimization problem (11)-(13) at each sampling time Is quite 
feasible. Results of simulation studies described below 
illustrate an Interesting feature of this approach: since the 

parameters Involved In the evaluation of V c (u(t)) and 
Vi(u(t)) depend on system measurements, the optimum 
distribution of relative cost, x (u) depends on on-line 
measurement data and hence, at each sampling Instant, the 
weighting between identification and regulation will change 
depending on the on-line system performance. This Is In 
contrast to[9] In which a fixed weighting between absolute 
control and Identification cost Is used at each sample time. 

In the simulation study we compare the performance of three 
control systems: 

a) A constrained adaptive controller that minimizes (9) 
subject to the control magnitude constraint. 

b) An optimum Identification controller that minimizes 
(14) subject to the control magnitude constraint. 

c) The one-step dual -adaptl ve controller based on 
( 1 1 ) - ( 13) . 

In Figures 1-3 we present simulated modal response data for the 
first flexible mode of the Langley beam experiment described In 
[10] where we assume here that a single actuator Is used. The 
accumulated on-line regulation cost, VT, shown In Figure 1 Is 
defined as 


VT('l) 


£ 


rno 


(15) 


120 



and the on-line Identification cost, PT, Is defined as 


PT( M) * trace [P(N)3 (16) 

where P ( M ) Is calculated on-line using (8). Mote that for the 
first 10 to 15 sampling times the regulation cost of the 
dual-adaptive controller Is close to that of the constrained 
minimum-variance controller and the Identification cost of the 
dual-adaptive control system Is close to that of the 
constrained one-step optimum Identification controller. 

Figure 2 Indicates that the dual-adaptive controller's actuator 
signals switch between Its limits, ^0.5, more frequently than 
do the actuator signals of the other controllers. This may be 
due to the lack of any energy constraint In the above problem 
formulation. 

A future study will examine the performance of the 
energy-constrained dual -adaptl ve controller In comparison with 
energy-constrained minimum-variance and one-step optimum 
Identification controllers. The relative regulation cost and 
relative Identification cost defined In (13) are plotted In 
Figure 3 where 

V c (N) = a 2 N (17) 

Is the accumulated control cost that would be achieved If the 
parameters of the system where known precise <y and If an 
unconstrained control law were used; o? > 10~* was used In 
the simulation runs. A constant value Yj 0 > 10-4 was chosen 
as Indicating the acceptable level of parameter 
Identification. Figure 3 Indicates that, depending on on-line 
measurements, the one-step Identification and regulation cost 
at one sampling Instant can have widely differing shapes from 
their respective distributions at other sampling times. This 
leads to the on-line variations In the dual-adaptive control 
strategy mentioned earlier. 

The simulation results Indication that the one-step, 
constrained dual-adaptive controller has the feature of 
providing, based on measured data, system Inputs that result In 
parameter Identification while maintaining bounded modal 
amplitude response. 


121 



REFERENCES 


Montgomery, R. C. and Thau, F.E., "Adaptive and Learning 
Control of Large Space Structures," AIAA Guidance and 
Control Conference, June, 1980. 

Sundararajan, N. and Montgomery, R. C., "Adaptive Control 
of a Flexible Beam Using Least Square Lattice Filters," 
1983 Large Space Structures Workshop, Blacksburg, Va. 

Feldbaum, A. A., "Dual Control Theory I - IV, Automation 
and Remote Control I960, vol . 21, pp. 874, pp. TUI'!;' JT967, 
vol. 22, pp. 1, pp. 109. 

Feldbaum, A. A., Optimal Control Systems, Academic Press, 
New York, 1965. 

Bar-Shalom, Y., "Stochastic Dynamic Programming: Caution 

and Probing, " IEEE Trans., Vol. AC - 2o, No. 5, Oct. 

1981, pp, 1184 - 1195. 

Wlttenmark, B., "Stochastic Adaptive Control Methods: A 

Survey, "Int. J. Control, Vol. 21, No. 5, pp. 705 - 730, 
1975. 

Bar-Shalom, Y. and Tse, £., "Dual Effect, Certainty 
Equivalence, and Separation In Stochastic Control, 

" IEEE Trans ., Vol. AC - 19, pp. 494 - 500, Oct. 1974. 

Astrom, K. J. and Wlttenmark, B., "Problems of 
Identification and Control," J. Math. Anal and Appl , 

Vol. 34, pp. 90 - 113, 1971 . 

Wlttenmark, B., "An Active Suboptlmal Dual Controller for 
Systems with Stochastic Parameters," Auto. Contr. Theory 
Appl . , Canada, Vol. e, pp. 13-19, 1975. 

Thau, F. E., Montogomery, R. C., and Horner, G.C., 

"On-Line Structural Parameter Identification," Proc. AIAA 
Guidance and Control Conference, August, 1981, 

Albuquerque, New Mexico, pp. 5^0-539. 




0 10 20 30 40 j 50 0 10 20 30 40 50 

SAMPLE TIME SAMPLE TIME 


(a) ONE-STEP OPTIMUM IDENTIFICATION 

(b) MINIMUM-VARIANCE ADAPTIVE 

(c) DUAL-ADAPTIVE 


Fi 9 1. On-Line Regulation and Identification Cost 
for Three Feedback Controllers 
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Fig 3. Relative Control Cost RC - V c (u)/V® and Relative 
Identification Cost RI = Vj(u)/V| for 
3 Sample Times 
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